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Abstract 

We define an analytic version of the graph property testing problem, which can be 
formulated as studying an unknown 2- variable symmetric function through sampling from 
its domain and studying the random graph obtained when using the function values as 
edge probabilities. We give a characterization of properties testable this way, and extend a 
number of results about "large graphs" to this setting. 

These results can be applied to the original graph-theoretic property testing. In particu- 
lar, we give a new combinatorial characterization of testable graph properties. Furthermore, 
we define a class of graph properties (flexible properties) which contains all the hereditary 
properties, and generalize various results of Alon, Shapira, Fischer, Newman and Stav from 
hereditary to flexible properties. 

1 Introduction 

Graph property testing is a very active area in computer science. In its most restricted form 
(and this will be our concern in this paper), it studies properties of (very large) graphs that can 
be tested by studying a randomly chosen induced subgraph of bounded size. 

To be more precise, we have to describe what kind of error is allowed. In this paper, by graph 
we always mean a finite simple graph. A graph property is a class of graphs invariant under 
isomorphism. The edit distance of two graphs G\,G2 on the same node set is \E(G\)AE(G2)\- 
The edit distance of a graph G from a graph property is the minimum number of edges we have 
to change (add or delete) to obtain a graph with the property. If no graph with the same number 
of nodes has the property, then this distance is infinite. 

Definition 1.1 A graph property V is testable, if there exists another property V' (called a 
test property) satisfying the following conditions: 

(a) if a graph G has property V, then for all 1 < k < |^(G)| at least a fraction of 2/3 of its 
fc-node induced subgraphs have property V , and 

(b) for every e > there is a k £ > 1 such that if G is a graph whose edit distance from V is 
at least e|T^(G)| 2 , then for all k e < k < \V(G)\ at most a fraction of 1/3 of the fc-node induced 
subgraphs of G have property V . 

The notion of testability has other variations: we may also know the number of nodes of G, 
or we can take a sample whose size is growing slowly with the size of G, etc. The definition 
above is in a sense the most restrictive, and it has often been referred to with adjectives like 
"oblivious testing" and " order independent testing" . Since this is the only version we consider 
in this paper, we simplify terminology by calling it simply "testable" . 

We could strengthen this definition by requiring a fraction of 1 — e instead of 2/3 and a 
fraction of e instead of 1/3. We could also weaken it by allowing the test property V' to depend 
on e. It can be seen that neither of these modifications would change the notion of testability. 

A surprisingly general sufficient condition for testability was proved by Alon and Shapira [5] : 
Every hereditary graph property is testable. (A graph property is hereditary, if whenever a graph 
has the property, then all its induced subgraphs also have the property.) Alon, Fischer, Newman 
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and Shapira pQ gave a characterization of testable graph properties in terms of Szemeredi par- 
titions (which is quite involved and we don't quote it here). In fact, Szemeredi partitions play a 
central role in most results of this theory. 

One of the main graph theoretic results in this paper is to give another combinatorial char- 
acterization of testable properties (Theorem I3.20p . It says that a graph property is testable if 
and only if for every graph with the property, a sufficiently large "typical" induced subgraph is 
"close" to having the property. 

Our main goal is, however, to treat property testing in terms of the theory of convergent 
(dense) graph sequences and graph limits [TUJ, [TTJ [T] . A sequence of graphs (G n ) is convergent 
if the density of copies of any fixed graph F in G n tends to a limit. It turns out that the 
limit of a convergent graph sequence can be represented by a symmetric measurable function 
W : [0, l] 2 — > [0, 1], and that many problems and constructions in graph theory have a simpler 
and cleaner formulation when extended to this limit (see [5] for a survey). 

Parameter testing (or estimation) is closely related to property testing, but is in many respects 
simpler. This area has a very natural treatment in the framework of graph limits [3[7]. The two 
theories are connected by a result of Fischer and Newman [5] , who proved that the edit distance 
from a testable property is a testable parameter. The analytic theory of property testing is more 
involved than the analytic theory of parameter testing, mainly because of the different type of 
error that is permitted. 

Above, we used the "edit distance" of two graphs in the definition of testable properties. 
However, there is a different distance, called the "cut distance", which plays a central role in 
graph convergence; for example, a sequence of graphs is convergent if and only if it is Cauchy 
in an appropriately normalized cut distance. The main technical issue in the analytic theory of 
property testing is the interplay between these two distances; see Section [2.51 for some auxiliary 
results of this nature that might be interesting on their own right. 

The space of limit objects (two-variable functions) with the "cut distance" is compact, a fact 
which is essentially equivalent to various (weak and strong) versions of Szemeredi's Regularity 
Lemma [llj . So while we do not explicitly use the Regularity Lemma, it is implicit in the 
utilization of the compactness of this space. 

A further surprisingly general result using the edit distance is the theorem of Alon and Stav 
[3], proving that for every hereditary property, a random graph with appropriate density is the 
farthest from the property in edit distance. The analytic results developed in this paper allow us 
to state and prove a simple analytic analogue of this fact, from which the original result follows 
along with generalizations. Similar analytic analogues are derived for the other above mentioned 
results. 

2 Preliminaries 
2.1 Homomorphisms 

For two graphs F and G, a homomorphism from F to G is an adjacency preserving map V(F) — > 
V(G). The number of such homomorphisms is denoted by hom(F, G). We'll almost always use 
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the normalized version of this number, 

hom(F, G) 



t{F,G) 



\V(G)\\ V (F)\- 



which can be interpreted as the probability that a random map V(F) — > V(G) is a homomor- 
phism. We denote by ind(-F, G) the number of those injective homomorphisms that also preserve 
non-adjacency (in other words, the number of induced copies of F in G). The normalized version 
of this number is 

ind(F.G) 
md[ ' ' " (\V(G)\)\v(f ){ 
(here (n)k = n(n — 1) • • • (n — k + 1)). 

2.2 Functions and graphons 

Let W denote the space of all symmetric measurable functions W : [0, l] 2 — ► K (i.e., W(x, y) — 
W(y, x) for all x,y £ [0, 1]). Let W denote the set of all functions W £ W such that < W < 1. 

A function W £ W is called a stepfunction, if there is a partition Si U • • • U S& of [0, 1] into 
measurable sets such that W is constant on every product set Si x Sj. The number k is the 
number of steps of W. 

Let ^-fc denote the set of all graphs on node set [k] = {1, . . . , k}. For W £ Wo and F £ J~k, 
define 

t(F,W)= [ TT W(xi,Xj)dx 

J\0,l] k ,,.-„/„N 



and 



und(F,w)= f n ^(^^i) n (i-^cxi,^))^. 



ijeE(F) HtE(F) 

Two functions Wi,W 2 £ W are isomorphic, in notation Wi = W 2 , if t(G, Wi) = i(G, W 2 ) 
for every graph G. It was proved by Borgs, Chayes and Lovasz [4] that two functions are 
isomorphic if and only if there is a third function U £ Wo & n d two measure preserving maps 
0i, 02 : [0, 1] — > [0, 1] such that Wi(x, y) = U{<j>i{x), <fii(y)) for i = 1, 2 and almost all x, y £ [0, 1]. 
An isomorphism class of functions in Wo is called a graphon [7]. 

A sequence of graphs (G n ) with |V(G„)| — > oo is called convergent, if t(F,G n ) tends to a 
limit for every fixed graph F. (This is equivalent with ti n a(F, G n ) tending to a limit for every 
F.) It was proved in [10] that for every convergent sequence of graphs (G„) there is a function 
W £ Wo such that t(F, G n ) — > i(F, IV) for every graph F. We call W the iwrwf of the sequence, 
and write G n — * VF. Every function in Wo arises as the limit of a convergent graph sequence. 
Furthermore, the limit is unique up to isomorphism. 

For every graph G, we define a function Wg £ Wo as follows. Let V(G) = {1, . . . , n} and 
consider a point [x, y) £ [0, l] 2 . Define integers i and j such that x £ ((« — l)/n,i/n] and 
y G ((j — X)/n,j/n] (if a; = we define i = 0, and similarly for j). Then we set 



W(x,y) = 



1, if ij £ E(G), 
0, otherwise. 
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(informally, we consider the adjacency matrix A — (a>ij) of G, and replace each entry ay by 
a square of size (l/n) x (1/ri) with the constant function ay on this square). Note that Wq 
depends on the labeling of the nodes of G (but only up to a measure preserving transformation) . 

2.3 Distances of graphs and functions 

As mentioned in the introduction, our results concern the interaction of two distances between 
graphs, the edit distance and the cut distance. Let Gi and G2 be two graphs with a common node 
set V. Instead of the edit distance mentioned in the introduction, we shall use its normalized 
version 

\E(Gi)AE(G 2 )\ 
cti(Oi,(_r 2 ] - pj-rp , 

and the normalized cut distance 



dn(Gi, G 2 ) = max 



e Gl (S, T) - e G2 (S,T) 



s,tcv \V\ 2 

(here ec 1 {S, T) denotes the number of edges of G with one endpoint in S and the other endpoint 
in T). 

We consider on W the cut norm 



\\W\\ n = sup / W{x 7 y)dxdy 

S,TC[0,1] JSxT 



where the supremum is taken over all measurable subsets S and T. (See [7] for several useful 
properties of this norm.) We will also use the standard L\ norm 



= / \W(x,y)\dxdy. 
J[o,i] 2 

This defines two metrics on Wo by 

d x {U,W) = \\U- W\\i, and d a (U, W) = \\U - W\\ a . 

For every set S C W and every c > 0, we define, as usual, the balls 

c) = {W € Wq : di(W, S) < c} and B n (S, c) = {W G Wq : d D (W, S) < c}. 

Clearly dp < dx, and hence Bx{S,c) C B\j(S,c). So dp is continuous with respect to dx- In 
general, dx is not continuous w.r.t. dp, but see Theorem 13.41 for a weaker statement. 
For W € Wo and : [0, 1] -> [0, 1], set W^(a;,i/) = W(<£(a;), ^(y)). We define 

5i(Z7,W) = infd^C/,^), 

where <f> ranges over all invertible measure preserving maps from [0, 1] to [0, 1]. This is a quasi- 
metric on Wo, in which U and W are distance if and only if they are isomorphic. The metric 
So{U,W) is defined analogously. 

The main advantage of over dp is that the space (W, is compact, as was proved in 

P. 
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We can use this distance to define yet another distance between graphs: 

5 a (G 1 ,G 2 ) = 8 D (W Gl ,W G2 ). 

Note that in this definition the graphs G\ and G 2 do not need to have the same number of 
nodes, and their distance is independent of the labeling of their nodes. If it happens that 
V{Gi) = V(G 2 ), then clearly 

Sa(G 1 ,G 2 ) < da(G 1: G 2 ) < d t (Gi, G 2 ). 

The paper [7J contains a more explicit description of this distance, and its relation to combinato- 
rially defined distances. One of the main conclusions is that a sequence of graphs is convergent 
if and only if it is Cauchy in this metric. So (Wo, <$□) is the completion of the set of graphs with 
distance S\j. 

We summarize some further facts about homomorphism densities and distances, mostly from 
[lOj . Let F be a graph with k nodes and m edges. For every graph G, we have 

t(F,G)=t(F,W G ), 

but for the "induced" versions we only have the following approximate equality: 

Lemma 4.1 in \W\ asserts that for any two functions U,W € Wo, 

\t(F,U)-t(F,W)\<m\\U-W\\ D . (2) 

This implies (via the functions Wg and Wr) a similar inequality for any two graphs G and H: 

\t(F,G)-t(F,H)\<mS u (G,H). (3) 

An analogue of inequality @ for induced densities in functions can be proved by essentially the 
same argument : 

\tind(F,U)-t ind (F,W)\< rj -\\U~WWo. (4) 
Using the easy inequality this implies for the induced densities in graphs that 

(assuming that \V(G)\,\V(H)\ > (*). 

The following result from [7] (Theorem 4.10) provides a converse to 

Theorem 2.1 Let U,W € Wo and let k > 1 be a positive integer. Assume that for every simple 
graph F on k nodes, we have 

\t(F,U) -t(F,W)\ < 3~ fe2 . 

Then 

22 

6n(U,W)< - 



VTogfc 
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We conclude with a lemma showing that convergence in the cut norm has good analytic 
properties. 



Lemma 2.2 Suppose that \\W n - W\\ a -> as n —> oo (W,W n € W ). Then for every Z e W 

\\Z(W n -W)\\n->0 (n->oo). 

In particular, 



Z(x,y)W n (x,y)dxdy -> / Z(x,y)W n (x,y)dxdy 
[o,i] 2 J[o,i] 2 



and 



W n -> W 
s Js 



for every measurable set S C [0, l]' 



Proof. If 2T is the indicator function of a rectangle, these conclusions follow from the definition 
of the ||.||p norm. Hence the conclusion follows for stepfunctions, since they are linear combi- 
nations of a finite number of indicator functions of rectangles. Then it follows for all integrable 
functions, since they are approximable in £i([0, l] 2 ) by stepfunctions. □ 

2.4 W-random graphs 

For every function W € Wo and every finite set X C [0,1], we define a graph G(X,U) on 
V(G) = X by connecting x, y G X, x ^ y with probability U (x, y) (making independent decisions 
for different pairs {x, y}). If G is a simple graph and X C V(G), we denote by G[X] the subgraph 
induced by X. 

For every function W £ Wo, we define the W -random graph G(n, W) = <G(X, W), where 
X ^ [0: 1] consists of n independent, uniformly distributed random points in [0, 1]. Note that 
for every F S J-„ , 

P(G(n,W) = F)=t ind (F,W). 

Clearly the distribution of G(n, W) is invariant under isomorphism of functions, i.e., it only 
depends on W as a graphon. 

We need an analogous notation G(k, G) = G[X], where G is a finite graph and X is a random 
subset of V(G) chosen uniformly from all /c-element subsets. 

It was proved in [TU] and with probability 1, G(n, W) — > W 7 , and in fact, the convergence is 
quite fast, as shown by the following concentration inequality: 

Theorem 2.3 Let W € Wo and let F be a graph with k nodes. Then for every < s < 1, 

p(\t(F,G(n,W))-t(F,W)\ >e) < 2cxp (-^n) . (6) 

The following bound on the distance of a VF-random graph from W was proved in [7] (The- 
orem 4.9(h)): 
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Theorem 2.4 Let U € Wo and let k > 1 be a positive integer. Then with probability at least 

1 - e -fe 2 /(21ogfc) 7 W£ hme 

8 n (U,G(k,U)) < -i£=. 

Vlogfc 

We'll use a related construction of a random graph associated with W. Let Xi be a uniform 
random element of the interval Li — \(i — l)/n,i/n], and X — {Xi, . . . ,X n }. Let G'(n, W) = 
G(X, W). (Note that the distribution of G(n, W) only depends on the isomorphism type of 
W, but <G'(n, W) does depend on how the points of [0, 1] are ordered.) For the cut distance of 
G'(n, W) from W we have the following bound: 

Lemma 2.5 Let W £ W and G — G'(n, W). Then with probability more than 1 — 1/yfn, 

■50 



Sn(W G >,U) < 



Vlog log n 



Proof. We use subgraph densities and Theorem 12. II To estimate \t(F, WQ/ n m) — t(F, Wq')\, 
note that the random variable t(F, Wcfn,!/)) can be generated as follows: (1) we select k random 
integers r\, . . . , r^ e [n] uniformly and independently (with repetition); (2) we select Zi G [0, 1/n] 
uniformly and independently for i = 1, . . . , k, and compute Xi — Zi + (fj — 1)/^; (3) we create 
random variables Yij (i,j £ [k]) such that Yij = Yji but otherwise they are independent, and 



Yij — 



1 with probability W(Xi, Xj), 
with probability 1 - W(X h Xj). 



Then t(F, Wc^n,//)) i s *he expectation of YiijeE(F) Yij over the choice (1). The computation of 
t(F, Wq>) is similar, except that if = rj then we choose Z± = Zj. It follows that if the integers 
ri, . . . , r,t are distinct, then t(F, Wc{ n ,u)) an d t(F, Wg<) are generated by the same procedure, 
and hence they can be coupled so that they are equal. The probability that there is no repetition 
among n,...,rk is 

n(n-l)---(n-fc + l) > x _ (a) 

n k n ' 

( k ) 

So it follows that t(F, Wq(ii, U)) = t(F, Wc) with probability at least 1 — — , and hence 

E(\t(F,W G(n:U) )-t(F,W G >)\) < ^. 
Summing this over all graphs F on k nodes, we get 

£|t(F,w G( „ iE/) )-i(^w- G 0|) < — , 

F 

and so the probability that £] F \t(F, W G („ tU j) - t(F, W&)\ > 3~ fc2 is less than 6 fc2 /"- Thus with 
probability more than 1 — 6 fe /n, we have 

-k 2 



\t(F, W G{n>u) ) - t(F, W G ,))\<3 
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for all graphs F with at most k nodes. Theorem 12.11 implies that in this case 

22 



5 D (G(n,U),G') < 



%/logfc 

So if we choose k — [\/log(?i/2)/4j , then 

44 

n,U),G')< 



Vloglogn 

Combining with we get that with probability more than 1 — \j^fn, 

S D (W G <,U) < 6 n (G'Mn,U)) + 5 n (W G{niU) ,U) < -£L= + ~ ir ^= < ™ 



Vlogn Vl°g log n Vl°g log " 

□ 

One advantage of using G'(n,W) over G(n,W) is that it is easier to handle its d\ distance 
from other graphs and functions. 

Lemma 2.6 Let W e W and G' = G'(n, W). Let H be a graph on [n]. Then 

E(di(Gf,ff)) = WWn-Wh 

and 

E(||^-W r || 1 )<2||W ff -W||i. 

Proof. For the first formula, note that for 1 < i,j < n, the probability that a pair i ^ j 
contributes to d\(H,G') is 

[l-WiX^Xj), if ijeE(G n ). 

Summing over all i ^ j, and taking expectation, the equality follows. The second inequality is 
an easy consequence: 

E(||Wg'-W||i) < E^W G >-WHh) + \\W H -W\\i = E(d 1 (G' ,H)) + \\W H -W\\ 1 < 2||W ff -W||i. 

□ 

As a useful consequence, we obtain the following fact. The Regularity Lemma implies (see 
e.g. [TT]) that functions in Wo can be approximated by functions of the form Wg, so that the 
number of nodes of G can be bounded uniformly if the error is measured in the cut distance. 
Obviously, one cannot approximate all functions by functions Wg in the Li-norm. But for every 
n there is graph on n nodes that approximates W so that the cut distance tends to uniformly, 
and at the same time the approximation in the L\ norm is almost as good as possible. 

Corollary 2.7 Let G be a simple graph and U G Wo- Then there exists a simple graph G' such 
that 

50 

||WG'-Ef||i<4||W G -Ef||i and 6 n (W G >,U)< 



Vlog log n 
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Proof. Let V{G) = [n]. We are going to show that the random graph G' = &'(n, W) satisfies 
the conditions with positive probability. By Lemma I231 E(di(W' G , Wj) < 2\\Wg — W\\i, and so 
with probability at least 1/2, 

dx(W G ,,W)<4\\W G -W\\ x . (7) 
On the other hand, Lemma 12.51 implies that with probability at least 3/4, we have 

S a (G',U)< . (8) 

Vloglogn 

So with positive probability, both ([7]) and © hold. □ 
2.5 Relating different norms 

As pointed out in the introduction, the analytic problem behind property testing is to relate the 
cut norm and the Li norm. In this section, which contains our main technical tools, we study 
this connection. Some of the lemmas below are purely analytic in nature, and may be of interest 
on their own right. 

Lemma 2.8 Suppose that \\U n -U\\u -> and \\W n -W\\ a ^Oasinoo (U,W,U n ,W n G Wo). 
Then 

]imw£\\W n -U n \\i > \\W-UWl 

n — *oo 

Proof. Let a(x,y) = sgn(U(x,y) — W(x,u)). Then 

\\U n -W n \\i> / a(x,y)(U n (x,y) -W n (x,y))dxdy 
J[o,i] 2 

a(x, y){U{x 7 y) - W(x, y)) dx dy = \\U - W\\ x 

by Lemma O □ 

There are easy examples showing that \\W n — U n \\i — ► \\W — U\\i does not hold in general: 
for example, let W n = WG( n> i/2) and W = U = U n = 1/2. But we can formulate two statements 
that provide partial converses to this lemma. 

Lemma 2.9 Suppose that \\U n -U\\ a ->■ and \\W n -W\\ a as n -> oo (U,W, U n , W n G W ). 
Suppose further that U is — 1 valued. Then 

\\U n -W n \\i -» \\u-w\\ v 

Proof. By Lemma [2^21 

||£^-W„||i < WUn-U^ + WU-WnWx 

U n + f (1 -U n )+ [ W n + f (1 - W n ) 

u=o Ju=i Ju=o JU=1 



W+ I (1-W) = \\U-W\\i. 

U=0 JU=1 

Combined with Lemma [2~81 the assertion follows. □ 
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Lemma 2.10 Suppose that \\U n - U\\u — *• asn^oo (U, U n £ Wo ). Then for every W £ Wo 
there is a sequence of functions W n £ Wo such that \\W n — W\\q — > and 

llf/n-Wnlli^UfZ-WHi. 
Proof. First we consider the case when U > W. Let 

Z(x,y) 

and W n = ZC/„. Trivially W n £W ,W = ZU, and 



W(x,y)/U(x,y) iiU(x,y)>0, 
otherwise. 



||VK-^ n || n = j|Z([/-CX, l )|| n ^0 

by Lemma 12.21 Furthermore, 

||l7n- Wnlli = \\(U-W) + Z(U-U n )\\ 1 < \\U-W\\x + \\Z{U-U n )\\ x ||[/- W||i 

(using Lemma E2] again). Combining this with Lemma l2.8l we get that ||C/n — W n \\i — > ||I7— W||i. 

The case when U <W follows by a similar argument, replacing U,W, . . . by 1 — U", 1 — W, 

Finally, in the general case, consider the function V — max({7, W). Then clearly \\U — V\\\ + 
|| V — W\\i = \\U — W\\\. Since U < V, there exists a sequence (V^) of functions such that 
\\V n - V\\n -> and ||K - C/„||i -> || V - f7|| x . Since V > W, there is a sequence (W„) of 
functions such that ||W„ - W||n ->• and \\W n - V n \\x -»■ || W - V\\i. Hence 

limsup||£/ n - W n \\i < limsup||f/„ - V n \\i + limsup||V n - W n \\i 

= \\U-V\\ 1 + \\V-W\\ 1 = \\U-W\\ 1 . 

Using Lemma [2T51 again, the lemma follows. □ 

A fact similar to Lemma [2~8l holds for the distances Si and <Jp replacing the norms ||.||i and 



||.||rj. This does not seem to follow directly from Lemma 12.81 and the proof is more involved. 

Lemma 2.11 Suppose that Sa(U n ,U) -> and S a (W n ,W) -> as n -> oo (£7, W, U n , W n £ 
Wo)- Then 

liminf 5i(W„, C/„) > *i(W, (7). (9) 

Proof. Let d denote the lim inf on the left hand side of ([9]). Let e be an arbitrary positive 
number. There is a number k (depending on U, W and e), a partition [0, 1] = U^ =1 Si of the unit 
interval into k measurable pieces and two functions W, U' £ Wo such that both W' and U' are 
constant on every rectangle Si x Sj and furthermore that HW 7 — W'||i, \\U — U'\\i < e. Let n be 
a natural number such that \\W n — W\\a , ||?7 TC — < e/fc 4 and #i(W„, J7„) < d + e. There 
are measure preserving transformations p, tt : [0, 1] > [0, 1] such that || — < d + 2e. Let 

Si.j denote the set S? n SJ for every 1 < i, j < k. It is clear that Sij is a partition of the unit 
interval and that both W' p and U' n are constant on each rectangle Si Xt j 1 x Si 2 j 2 . We have that 



d+2 e >||W*-C£|| 1 > J2 / W-^) 



11 



Using that 



we get that 



(W - W p ) 



Si 1 j 1 xSi 2 j 2 



< and 



(17* - Ul) 



Si 1 ,j 1 x Si 2 j 2 



< 



fc 4 ' 



E 

I<*ll3l,i2,j2<* 



(W - IT) 



<d + 4e. 



^il ,31 X ,32 

Writing W - U n = (W ~ W' p ) + (W' p - U /7T ) + {If' 71 - U*) we obtain that 



E |/ {W-U«)\> E (|/ 



l<-»l,jl,l2,j2<fc " S< l'il XSi 2.J2 



^il Jl x ■ J2 



- w' p ) 



Si x 7 j ± X Si 2 



(W' p - 17") 



This implies that 



E |/ (w-u*)|> s 1/ 



l<n Jl,*2 ,J*2<fc Si l'3'l XSi 2,32 



(W' p - u' n ) 



l<*l,il,»2,Ja<fc 'l'" XSi 2,32 



-2e. 



Using that both VU and {/ are constant on the sets Stiji x Si 2 ,j 2 we obtain that the right 
side of the above inequality is equal to \\W' P — U' n \\i — 2e. Consequently 

\\W' P - U'* ]]!<<!+ 6e. 

Using that \\W - tU' p ||i, \\U* - J7 ,7r ||i < e we get that 

\\W P - U*h <d+lOe. 



Since e > is arbitrary, |9]) follows. 



□ 



3 Main results 

3.1 Property testing for functions 

We define a notion of testability for properties of functions in Wo and for graphons. Formally, a 
function property is a subset 1Z C Wo; a graphon property is a function property that is invariant 
under isomorphism. A function property is closed if it is closed in the j|.||n norm. 

Example 3.1 The function property of being 0-1 valued is a graphon property by the charac- 
terization of isomorphism, but it is not closed, since for a sequence of random graphs G n with 
edge probability 1/2, the functions Wc n are 0-1 valued, but their limit in the ||.||p norm, namely 
the identically 1/2 function, is not. 

The following definition of testability of a function property is analogous to the testability of 
a graph property. The framework is that we study a function W € Wo by observing a TU-random 
graph G(k,W). 
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Definition 3.2 A function property TZ is testable if there is a graph property TZ' (called a test 
property for TZ) such that 

(a) P(G(fc, W) G TZ') > 2/3 for every function W e TZ and every fc > 1, and 

(b) for every e > there is a k E > 1 such that P(G(fc, IV) € 72-') < 1/3 for every k> k e and 
every function We W with di(W,72.) > e. 

Similarly as for graph properties, the constants 1/3 and 2/3 are arbitrary, but it would not 
change the property if we replaced them by any two real numbers < a < b < 1 : 

Lemma 3.3 Let < a < b < 1. A graphon property TZ is testable if and only if there is a 
graph property TZ" such that for every e > there is a constant k(e) such that for every function 
W G W andk> k(e), 

(a) if W eTl, then P(G(k,W) G TZ") > b, and 

(b) ifd x (W,n) > e then P(G(k,W) € TZ") < a. 

Proof. Suppose that TZ is testable, and let TZ' be the graph property in the definition of 
testability. For every simple graph F and k < \V(F)\, define 

q k (F) = P(G(k,F)eH'). 

Define the graph property 

TZ" = {G : q k (F) > I}. 
Let W € TZ and let n > k be large enough. Then 

E(q k (G(n,W))) = P(G(k,W) e TZ') > | 

We use Azuma's inequality to show that qf.(G(n,W)) is highly concentrated. Indeed, let 
Xt, ...,X n S [0,1] be independent uniform samples, and let Z t — E t (qk{G(n,W))) , where 
E t means conditional expectation with respect to the choice of X± , . . . , X t and the randomiza- 
tion of the edges between nodes {1, . . . , t}. Then Zq, . . . , Z n is a martingale, with expectation 
E(Zo) = E(<7fc(G(n, W))) > 2/3. Furthermore, \Z t +\ — Z t \ is bounded by the probability that a 
random fc-subset of {1, . . . , n} contains t + 1, which is k/n. Thus by Azuma's Inequality, 

p(«*(G(n, W)) < i) < p(q k (G(n,W)) < E(q k (G(n,W))) - i) < ex P (-^). 
Choosing n large enough, this probability will be less than 1 — 6, and hence 

P(G(n,W) £ TZ") > b. 

So (a) is satisfied. The proof of (b) is analogous, and so is the proof of the converse. □ 

It follows from the definition that a graphon property is testable if and only if its closure in 
the ||.||i norm is testable. Furthermore, the closure in the norm of a testable property is 
testable (but not the other way around, see example 13. 6f d) below). It follows from Theorem l3.4l 
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below (but it is not hard to see directly too) that if TZ is testable, then its closures in the ||.|ji 
norm and ||.||n norm coincide. 

Since the distribution of G(k, W) is preserved under isomorphism of W, every closed testable 
function property is a graphon property. In our applications to graph theory only closed graphon 
properties will play a role, and we'll focus on characterizing testability for such properties. 

It is trivial that Bi(TZ, e) C B\j(TZ, e) for every TZ C W and e > 0. A reverse containment 
characterizes testable graphon properties. 

Theorem 3.4 A graphon property TZ is testable if and only if for every e > there is an e' > 
such that B n (Tl, e') C B x (11, e) . 

Proof. Suppose that TZ is testable with test property 7Z'. Let e > 0, let k = k(e) be the constant 
in the definition, and let e' = 2~ k . Suppose that for some W S Wo, we have da(W,TZ) < e' . 
Then there is a U £ 1Z such that d\j(W, U) < e' . By {!]), we have for every graph F on k nodes 

\Und(F,W)-t iud (F,U)\ < Q)e'- 

Thus 

|P(G(*, W) i K') - P(G(*, U)£1Z')\< l^ d ( F ' W ) ~ l ^ U )\ ^ 2 ® Ci) £ ' < I' 

Hence P(G(n, ^ < P(G(n, 17) ^ + | < |, which proves that di(W, < e. 
Conversely, suppose that 7?. satisfies the condition in the proposition. Choose 

k n = l;^\ogn, e n = n~ 1/3 . 

Let 1Z' be the graph property that a graph G with |V(Cr)| = n has if and only if there exists a 
U e TZ such that \t(F, U) - t(F, G)\ < e n for all graphs F with \V(F)\ < k n . We show that (a) 
and (b) are satisfied. 

First, suppose that W £ TZ. Let n be large, and let F be a graph with at most k n nodes. By 
Theorem^ we have \t(F, G(n, W)) - t(F, W)\ < e n with probability at least 




Since the number of graphs on at most k n nodes is at most exp(fc^/2), the probability that 
\t(F, G(ri, W)) — t(F, W)\ < e n holds for every graph F on at most k n nodes is at least 

if n is large enough. Since W £ TZ, in every such case G(n, W) £ TZ' . This proves that TZ' 
satisfies (a). 

Second, let e > and suppose that di(W,TZ) > e. By hypothesis, there is an e' > 
(depending only on e) such that da(W,TZ) > e' . 
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Let n be large, and suppose, by way of contradiction, that with probability larger than 1/3, 
we have G(n, W) £ 1Z' . In every such case, there exists a function U £ 1Z such that 



\t(F,U)-t(F,G(n,W))\<e n (10) 

for all graphs F with at most k n nodes. Similarly as above, Theorem 2.5 in [TU] implies that 

\t(F,G(n,W)) - t(F,W)\ <e n (11) 

for all F with at most k n nodes with probability at least 2/3. There is an outcome for G(n, W) 
for which both (fTU|) and (|11[) occur, and so there always exists a function [/ £ 1Z such that 

|t(F,W)-t(F,E0| < 2e„ 

for all graphs F with at most fc n nodes. By Theorem 3.6 in [7], it follows that 

22 

d a (U,W)< " 



V /1 °S2 k n 

If n is large enough, this is less that e' , contradicting the definition of e' . □ 

We also prove the following characterization, which is a functional analogue of the charac- 
terization of Alon, Fischer, Newman and Shapira [T]. 

Theorem 3.5 A graphon property TZ is testable if and only if for every e > there is an e' > 
and a finite set S of stepfunctions such that 1Z C B\j(S, e') C B\{TZ, e). 

This theorem gives a "constructive" method of testing a testable graphon property: for every 
fixed error bound, it suffices to compute the da distance from a finite number of stepfunctions 
to separate the case when W £ 1Z from the case when di(W, 1Z) > e. 

Proof. First, suppose that 1Z is testable, and let e > 0. By Theorem 13.41 there is an 
ef > such that Ba(1Z, 2e') C B±(lZ,s). For every stepfunction s, consider the open ball 
Ba(s,e'). These balls cover the whole space, so by the compactness of (W, 5a) there is a 
finite set So of stepfunctions such that the corresponding balls cover the whole space. Let S 
be the set of those stepfunctions in for which the corresponding balls intersect 1Z. Then 
clearly 1Z C £?□(£, e'). On the other hand, Ba(U,e') intersects 1Z for every U £ S, and hence 
B u {S,s')<ZB u {n,2e')^B x {n,e). 

Second, suppose that 1Z satisfies the condition in the theorem, then for every e > there 
exists an e' > and a finite set S of stepfunctions such that 1Z C Ba(S,e') C Bi(TZ,e/2). Let 
e" = ee'/6. We claim that 

SnCSy + e'OcB^Bn^eO.e/Z). (12) 

Indeed, let PF G S n (5,e'+e"). Then there is a U £ S such that \\U - W\\a < e' + 2e". Consider 
Y = (1- |e)W+|eJ7. Then 

||y - £%i = ||(1 - ^e)(C/ - W)\\ u < (1 - ^)(e' + 2e") < ef, 
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so Y G B\j(S,e'). On the other hand, 

\\W-Y\\ 1 = \\±e(W-U)\\ 1 <±e<±e, 

and so W € Bi(Bfj(S,e'),e/2). This proves fl!2p. which in turn implies that 

B n (72,e") C Bp(B p (S,e , ),e") C B 1 (B p (5,e').e/2) C Bi{B x {1l, e/2), e/2) = £?i(72, e). 

This proves that 72. is testable. □ 
We conclude this section with some examples of testable and non-testable function properties. 

Example 3.6 (a) Let 72 = {U}, where U G W is the identically 1/2 function. Clearly this is 
invariant under isomorphism. Consider a random graph G n — G(n, 1/2); then || Wg„ — U\\\z\ — > 
with probability 1, but ||Wg„ — £7||i = 1/2 for every n. So this property is not testable by 
Theorem l3Tl 

On the other hand, the complementary property 72 c = Wo \ {U} is testable; indeed, its 
closure is Wo (either in the ||.||p norm or the ||.||i norm), which is trivially testable. 

(b) Let S C Wo be an arbitrary graphon property and let a > be an arbitrary number. 
Then 72 = B\j(S, a) is testable. 

Indeed, for e > define e' = oe/(2 - 2e). Let IF € B u {H,e'). Then IF € Sp(«S,a + e'), 
and so there is a U G 5 such that ||f7 - W||p < a + 2e'. Consider F = (1 - e)IF + eC/. Then 
ll y - U\\u = 11(1 - e){U - W)\\ u < (1 - + 2e') = a, so 7 e R. On the other hand, 
= ||e(W- E/)lli < e, and so W G Bi(ft,e). 

(c) For every fixed graph F and < c < 1, the property 72. that t(F, IF) = c is testable; an 
appropriate test property is 

^' = {G: | t (F >G )-4<_^}. (13) 
Indeed, it follows from Theorem 12.31 that with probability 1 — o(l), 

|*(F,W)-t(F,G(n,WO)l < r^- 

vogn 

So if i (F, IF) = c, then G(n, IF) G 72.' with large probability. Conversely, assume that di (IF, 72.) > 
e, and let (say) t(F,W) > c. The functions U s = (1 - s)IF, < s < e, are all in Bi(IF,e), 
and hence not in 72. It follows that t(F,U s ) > c for all s. On the other hand, t(F,U e ) — 
(1 - e)\ E ^k(F,W), and so t(F,W) > (1 - £ )-l^(^)l c , w hich implies that G(n, IF) $ 72' with 
large probability. 

Fixing two subgraph densities, however, may yield a non-testable property: for example, 
t(K 2 , IF) = 1/2 and t(C 4 , IF) = 1/16 imply that IF = 1/2 (see [6]). 

(d) The graphon property that IF is 0-1 valued is not testable. It is closed in the ||.||i norm, 
but its closure in the ||.||p norm is the whole set Wo- 



16 



3.2 Graph properties vs. function properties 
3.2.1 Closure of graph properties 

We want to establish the connection between testability of graph properties and graphon proper- 
ties. The fact that graphons arise as limits of graph sequences suggests the following definition. 

Definition 3.7 If V is a graph property, then we define its closure V as the set of all functions 
W € Wo for which there exists a sequence of graphs G n G V with |V(G n )| — ► oo such that 
G n — ► W (i.e., Wg„ converges to W in the <5p metric). 

Clearly V is closed under isomorphism, i.e. it is a graphon property. The following examples 
show that V is not necessarily an "extension" of V in the sense that V cannot be recovered from 
it. Intuitively, V is a nice object which is a "clean" version of V\ it is an analytic profile of the 
property V, which eliminates all uncontrollable noise from it. 

Example 3.8 (a) Let V be the graph property that the graph doesn't have a 4-cycle. Then 
only the function has property V . In fact, graphs without 4-cycles are sparse and property 
testing (in the sense of Definition II. 1| does not distinguish sparse graphs from each other. 

(b) Let V be the graph property that the graph has an even number of edges. Rather 
counter-intuitively, this property is testable according to Definition II .11 above, and its closure is 
the whole set Wo- 

(c) Let V be the graph property that the graph has an even number of nodes. This property 
is not testable, since adding a single node to a large graph changes the distribution of small 
induced subgraphs by very little, but changes the property. The closure of this property is again 
the whole set Wo (which is testable). 

(d) Quasirandomness is defined as a property of a sequence of graphs, but we can make it a 
graph property Q (at the cost of a somewhat arbitrary choice of the error bound) as follows: a 
graph G on n nodes is quasirandom, if 



1 



2 



1 1 



t(K 2 ,G) - - < and i(C 4 , G) < — 



logn 16 logn 



The closure Q of this property consists of only one function, the identically 1/2 function. This 
singleton set of functions is not testable, since for any sequence (G„) of quasirandom graphs, 
|| W G „ - |||n -> but \\W Gn - = 1/2. This implies (by Theorem I3T81 below) that quasiran- 
domness is not a testable property. 

The first part of the following fact was stated in [5] . 

Proposition 3.9 (a) The closure of a hereditary graph property V consists of those functions 
W G Wo for which 

P(G{k,W) <£V) = 
for every k. Equivalently, t mt i{F, W) = whenever F <£ V . 
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(b) The closure of a testable graph property V consists of those functions W € Wo for which 

E(di(G(fc ) W),7>))->0 (fc^oo). 
It follows from Theorem 13.51 that in the last formula, we could replace d\ by drj. 

Proof, (a) Assume that P(G(fc, W) i V) = for every k. Since G(k, W) -> W with probability 
1 as k — > oo, it follows that W € V. 

To show the converse, assume that W £ V, and let (G„) be a sequence of graphs such that 
G„ € "P and G n — ► W. We can write 

P(G(fc,W)gP)= ^ P(G(A,W0 = -F')= E *i»d(F,W), (14) 
Fer k \v Fer k \v 

so it suffices to prove that t- m d(F, W) = for F £ V . By we have 

|tind(F,G„) -tindC^WcJI < — ^L- (15) 

n - (a) 

for every F £ Tk- For F G JFfc \ P, we have £i n d(-F, G„) = since G n £ V and the property is 
hereditary. Hence by f|15[> . in this case 

t ind {F 7 W Gn ) ^ (n->oo). 

But G n — > VF implies that 

iind^WcJ^tind^W), 

which proves (a). 

(b) The argument is similar, but a little more complicated. First, assume that 
E(di(G(fc, W), V)) — > as k — > oo. Then we can select a sequence fci < /c 2 < . . . of inte- 
gers such that 

P(d 1 (G(fc m ,W),P)>i-) <-L. 

Since the right hand sides have a finite sum, it follows by the Borel-Cantelli Lemma that 
di(G(k m ,W),V) — > with probability 1. Hence with probability 1, there are graphs H m 6 V 
such that di(G(k m ,W),H m ) — ► 0, which implies that 5i(G(fc m , VF), i? m ) — > 0. Furthermore, if 
the sequence (k m ) is sufficiently sparse, we have G(fc m , W) — > IF with probability 1 as fc — » oo, 
and so 7J m — » JF with probability 1. Thus W £ "P. 

Second, assume that W £ P, and let (G n ) be a sequence of graphs such that G„ € T 3 and 
G n — > VF. Fix any e > 0. By the remark after the definition of property testing, there is a 
graph property V 1 and a natural number fc E such that for all k,n with k e < k < \V(G n )\ we 
have P(G(fe,G„) $ V) < e, but P(G(fe,G) £ V) > 1/2 for all graphs G with \V{G)\ > k and 
d 1 (G,V) > e. By dUl) we have 

P(G(k,W)<£V')= ^nd(F,W) 

< J2 t ind(F,G n )+ Yl \tind(F,G n )-t ind (F,W Gn )\ 

+ Yj \tind{F,W Gn )-t ind {F,W)\. 
FeFkYP' 
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Here the first sum is P(G(fc, G n ) </ V) < e. The other two sums tend to as n — > oo by (fT5|) 
and by G„ -> W. So for k > k , we get that P(G(fc, W) </ V) < e. 

Note that G(k,W) has the same distribution as G(fc,G(n, W)) for n > k. Then for large 
enough k, n we have 

P(G(fc,W) £ P') > ipfaCGfoW),?) > e), 
and so P(d x (G(n, W), P') > e) < 2e. Hence 

E(d x (G(n, W), 7?)) < (1 - 2e) • e + (2e) • 1< 3e. 

Since e was arbitrary, this proves that E(di(G(n,W),V)) — ► 0. □ 
Part (a) of Proposition 13.91 implies: 

Corollary 3.10 If V is a hereditary graph property, then W € V depends only on the support 
ofW. 

We conclude with two lemmas on testability. We'll say more about both ( Theorems 13 . 201 and 
I3.18j) . but these simple lemmas will be needed before that. 

Lemma 3.11 Let V be a testable graph property. Then for every e > there is an e' > 
and a positive integer n' such that if G are two graphs with G' € V , 6\j(G,G') < s' and 
\V(G)\, \V{G')\ > n', then d x {G,V) < e. 

Proof. Let e > and let k = k(e) in the definition of testability. We show that n' = 9k 2 
and e' = 1/k 2 is a good choice. Since G' has property V, we have that at least 2/3 of its 
fc-node induced subgraphs have property V' . It follows by © that more than 1/3 of the fc-node 
subgraphs of G have the property V' . Hence di(G,V) < s by testability. □ 

Lemma 3.12 IfV is testable then V is testable. 

The converse is not true in general, but in Theorem 13.181 we will give a characterization of 
testable properties in terms of their closure. 

Proof. It suffices to prove that if (W n ) is a sequence of functions in Wo such that du(W n ,V) — > 
0, then di(W n ,V) — > 0. We may assume that the sequence W n is convergent, so W n — > U for 
some U £ Wo (in the 6\j distance). Clearly U G V, so by the definition of closure, there are 
graphs H„ £ V such that |V(£T n )| — > oo and H n — * U. 

Fix any e > 0. By Lemma 13.111 there is an e' > such that if |V(G)|, | V(i?)] are large 
enough, H £ V, and So{G,H) < e', then di(G,V) < e. Furthermore, there is an n e > 1 such 
that if n> n £ , then 5 a (W Hn ,U),5 a (W n ,U) < e'/3. 

Fix any n > n e , and let G nym (m = 1, 2, . . . ) be a sequence of graphs such that |V(G n ,m)| ~~ * 
oo and G n>m — > W n as m — > oo. Then 

fa(if„, G„, m ) < J n (Wif„, C/) + fa(f/, W n ) + 5 D (W„, W G „ im ) < e' 
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if m is large enough, hence by the choice of e' , we have di(G n ,m,'P) < e. This means that there 
are graphs J n . m G V with V(J n . m ) = F(G„, m ) such that di(G n , m , J n ,m) < £■ By choosing a 
subsequence, we can assume that J„ jm — ► U n as m — * oo for some U n £ V. Applying Lemma 
12.111 we obtain that 

di(W n ,V) < SxiWn, U n ) < liminf 5 x {W Gn m , W Jn J < liminf d 1 (G n , m , J n , n ) < e. 

m — >oo ' rn — >oo 

This proves the Lemma. □ 



3.2.2 Closure and distance 

What is the relationship between the d\ distance from a property and from its closure? The 
following propositions summarize what we know. We say that a graph G' is an equitable m- 
blowup of G if it is obtained by replacing each node of G by m or m + 1 twin copies (m > 1). 

Proposition 3.13 (a) For every hereditary graph property V and every graph G, 

di(G,V) <di(W G ,P). 

(b) For every testable graph property V , 

\di{G,V)-di{W G ,V)\^Q {\V{G)\^oo). 

(c) Let V be an arbitrary graph property and let G^G 2 , . . . be all equitable blowups of a graph 
G. Then 

limm{di(G n ,V) = di(W G ,V). 

Proof, (a) Let S > and U £ V be such that \\W G - U\\x < d^WcV) + 5. Using Corollary 
13.101 we may assume that U is a — 1-valued function. From the fact that G(n, U) has property 
V with probability 1, it follows that G' = G'(n, U) has property V with probability 1. By Lemma 

E(d 1 (G,G')) = \\Wg-U\U <d 1 {W G ,V) + 5. 

Hence there is an instance of G' for which G' G V and d 1 {G,G') < dt(W G ,V) + 6. Thus 
di{G,V) < d±(G, G') < dxiWcV) + 5. Since S is arbitrary, this proves (a). 

(b) Suppose not, then there exists a sequence of graphs (G n ) with |V(G„)| — > oo such that 
di(G n ,'P) — > a and d 1 (W Gn ,V) — * 6, where a ^= b. We may assume that V(G„) = [<7„], where 
g„ -> oo. 

First, select graphs ff„ G "P such that V(if n ) = V(G„) and d\{G n ,H n ) = d(G n ,V). By 
selecting a subsequence, we may assume that the sequence H n is convergent; let U G Wo be its 
limit. Clearly U G V. Then fa(VF H „,P) < 5 (W Hn ,U) -> 0, and hence rf n (Wff n! P) -> 0. By 
Theorem 13.41 this implies that o?i(T / l / ^f rl , V) — > 0. But then in the inequality 

di (W Gn , V) < di {W Gn ,W H J + dx {W Hn , V) 
the first term on the right hand side tends to a, while the second tends to 0, showing that b < a. 
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Second, fix e > 0, and let U n G V be such that d 1 (W Gn ,U n ) < d 1 (W Gn ,T>) + e. By the 
definition of testability of functions, there is an to > 1 such that for to > too and every n, we 
have that G(m, U n ) G 7-" with probability at least 2/3. 

Similarly as in the proof of (a), consider the random graph G' = G'(q n , U n ). Clearly 

d l (G n) P)<d 1 (G n ,G?) + d 1 (G',P). (16) 

Here by Lemma [5^] 

E(d 1 (G n ,G')) = d 1 (W G „,U n ), (17) 
Next we show that, with probability tending to 1 as n — > oo, we have 

di(G n ,G')<di(W Gn ,U n ) + e. (18) 

Indeed, the left hand side is the sum of q n /2 independent random variables, all — 1 valued, so 
this follows by the Law of Large Numbers. 

To estimate the other term in (TT6|) , let Z be a random m-element subset of X, and let Y 
be a random TO-elements subset of [0, 1]. The distributions of Z and Y are very close; indeed, 
we can generate Z by generating Y and accepting it if its elements fall into different intervals 
[i/q n , (i + l)/q n ], i = 0, . . . ,q n — 1, and only regenerate otherwise. The probability that we 
need to regenerate tends to as n — > oo, so the total variation distance of Z and Y tends to 
0. The probability that G(Y, U n ) € V is at least 2/3 as V is testable by Lemma [3.12[ and so 
the probability that &(Z, U n ) € V is at least 1/2. But G(Z, U n ) has the same distribution as a 
random m-node subgraph of G', and so by testability, 

P(di(G',V) <e) + ip^CCP) > e) >\- 

This implies that with probability at least 1/4, 

di(G',V)<e. (19) 

With positive probability, both (fT5)) and P^|) occur, and so by (jTSJ) we have 

dx{G n ,V) < dx{G n ,G') + dxiG' ,V) < d^W^ , U n ) + e + e < di(W Gn ,P) + 3e. 

Sending n — > oo we see that a < b + 3e. Since e was arbitrary, it follows that a < 6, which is a 
contradiction. 

(c) Let e > 0, and let [/ G P be a function such that ||Wg - U\\i < di(W G , U) + e. Let H n 
be a sequence of graphs such that H n — > [/ and i? ra € P. Then for an appropriate labeling of 
the nodes of H„, we have || Wh„ — U\\o — > 0. Since W G is — 1 valued, Lemma \23\ implies that 
\\W H „ -Wg||i ~> ||^- W G ||i. Let V(G) = {l,...,fe} and V(H n ) = {l,...,m}. Choose n large 
enough so that ||Wjy„ — Wg||i < \\U — W G \\% +e and to > k/e. 

Let Vi = { — l)m/fcj + 1, . . . , [im/k\ } for i = 1, . . . ,k. Then {V\, ■ ■ ■ , Vk) is a partition 
of V{H n ) into fc almost equal classes. Define a graph G' on {1, ... , k} by connecting m e 7, to 
w G Vj ; if and only if ij G E(G). Then G" is an equitable blowup of G. Furthermore, W G > and 
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Wq differ only on stripes of width less than 1/m along the orders of the squares on which W G 
is constant, so \\Wg' — Wg\\i < 2fc/m < 2s. Thus 

di(G',V) <dx{G',H n ) = \\W g >-WhJ\i < \\W G > - W G \\i + \\W G - W H Ji 
<2e+ (\\U- Wg||i + e) < d x {W G ,V) + 4e. 

Since e was arbitrary, this proves that 

liminfdi(G"\7>) < tZi(W G ,P). 

To prove the converse, let a — liminf di(G n ,'P), and let H n 6 ? be chosen so that V(H n ) — 
V(G n ) and di(H n ,G n ) = di(G n ,V). Select a subsequence such d\(H n — G n ) — > a, and choose 
a further subsequence so that H n is convergent. Let H n —> U ^V. We have Su{Wh„, C/) — > 
and So{Wg", Wg) — * 0, hence by Lemma [2~TT1 

^i(Wg, U) < Uminf 6i(W Hn , W Gn ) < a. 

Hence d 1 {W G ,V) < 5i{W G ,U) < a. □ 

Proposition 3.14 Let V be. any graph property and G n —> W , a convergent graph sequence. 
Then 

liminf dAG^V) > dAW.V), 

Proof. Let H n e V be such that V{G n ) = V(H n ) and di(G n ,H n ) = di(G n ,P). We 
may select a subsequence so that H n — > [/ for some U £ Wq. Clearly U £ V. Furthermore, 
\\W Gn ~ W\\n -> and \\W Hn - U\\ u -> 0, so by LemmadHl we have 



di(W,V) < \\W- U\\i < liminf ||W G „ - W H J\i = limin£ d i (G n ,V ), 
a contradiction. □ 



3.2.3 Monotone closure 

For two functions U, W £ Wo we write U -< W if there exist functions U',W £ W such that 
U = U', W = W, and [/' < W almost everywhere. 

Let V a graph property. By its upward closure we mean the graph property consisting 
of those graphs that have a spanning subgraph in V. For a function property 1Z, we define its 
upward closure to consist of those functions W £ Wq for which there exists a function U £ 1Z 
such that U . In both versions, the downward closure is defined analogously. 

The following theorem, whose proof is surprisingly nontrivial, asserts that closure and upward 
closure commute. 



Theorem 3.15 For every graph property V , 
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Proof. First, let g Then there exists a graph sequence G n — > W £ Wo such that G n 
has a spanning subgraph G^ £P. We consider the pair (G n , G^) as the graph G„ in which the 
edges of G' n colored red, the remaining edges are colored blue. We may choose a subsequence 
of the indices so that the remaining sequence is convergent as 2- edge- colored graphs, meaning 
that for every 2-edge-colored simple graph F, the sequence of densities t(F, G n ) of the densities 
of color-preserving homomorphisms is convergent. It is shown in [12] that the limit object of a 
such a sequence can be described by a pair of functions U, V £ Wo, such that V is the limit of 
the sequence (G n ), U is the limit of the sequence (G' n ), and U < V almost everywhere. Hence 
V = W and U e V. This proves that W € 

Conversely, let W £ fP) T , then there is a U £ V and a V ^ W such that U < V. Let 
G n £ V be such that G n — > [/. By Lemma 4.16 in [7], we may label the graphs G„ so that 
|| W Gn — U\\\j —> 0. Let C = C n denote the partition of [0, 1] into N — |V(G n )| equal intervals 
{Ii, . . . , In}, an d f° r W € W, let Wc denote the function obtained by replacing W by its average 
on each of the intervals Ii x L . Clearly Uc < Vc, and 

\\U-U c \\ n ->0, \\V-Vc\\ n ^0 (20) 

(this holds even with the Li-norm in place of the cut norm), and so 

\\U C - W G Jo ^0. (21) 



Define 



W n = V c + ^-^(W Gn ~U c ) 
I — Uc 



By Lemma [272 



(where the second term is wherever Uc = 1)- Then 

W n <V c + ]-^(l-Uc) = L 

and 

n C \ 1-Uc ) l-Uc - uc \ l-U c ) l-U c " 

\\K-Vdn= ^— ^r(W Gn -Uc) (n-oo). (22) 

Since is a stepfunction that is constant on intervals I x J (J, Jg £), it can be viewed as Wff„ 
for some weighted graph H n on [AT]. Create a random graph G' n as follows: for 1 < i < j < N, 
connect i and j with probability equal to weight of the edge in H n . Lemma 4.3 in [7] implies 
that with probability at least 1 — e~ N , 

\\W G > n -W H Jn<^S=, 
v Jy 



and hence 

\\W G i n -W^\\a^0 (n->oo). (23) 
Trivially, G„ is a subgraph of G' n with probability 1, and ([2H)l . (|2ip and (j2"B"|) imply that 

II Wb; - V|| n < || W G ; - <lb + IK - V c \\a + \\V C - V\\ a 0. 
This proves that V eV^, and so W £ . □ 
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3.2.4 Robustness 



The following definition will be important in our characterization of testable graph properties. 

Definition 3.16 A graph property V is robust, if for every e > there are numbers n = n(e) > 
and s' > such that if G is a graph with \V(G)\ > n(e) and d^WcV) < e' , then d\(G,V) < s. 

Proposition l3.13T b) implies that every testable property, in particular every hereditary prop- 
erty, is robust. The following fact, which is an immediate consequence of Proposition 13.131 
provides a combinatorial criterion for robustness. 

Proposition 3.17 A graph property V is robust if and only if for every e > there is an e' > 
and an n e > 1 such that if G is a graph with \V(G)\ > n E and G has infinitely many equitable 
blowups G' with di(G',V) < e' , then d x {G,V) < e. 

3.2.5 Characterizing testability of graph properties 

Our next main theorem shows the relationship between analytic and graph theoretic testability. 

Theorem 3.18 A graph property V is testable if and only if it is robust and its closure V is 
testable. 

Proof. We have seen that every testable graph property is robust (Proposition 13. 13T b)). and 
that its closure is testable (Lemma I3.12p . So to complete the proof, it suffices to prove that if V 
is robust and V is testable then V is testable. 

Let e > 0. By the robustness of V, there is an e' > and k £ > 1 such that if |V(G)| > k e 
and di(Wc,'P) < e', than di(G,V) < e. By the testability of V and Theorem 13.41 there is an 
e" > such that if d (W,V) < e" then di(W,V) < e' . By the definition of V and by Theorem 
2.9 in 0, there exists an n £ > k e such that 

(i) for every graph G eV with \V(G)\ > n e , we have d D (W G ,V) < e"/4; 

(ii) for every graph G with |V(G)| > n e and every n £ < m < \V(G)\, we have 
S n (G(m,G),G) < e"/4 with probability at least 2/3. 

Let V denote the property of a graph G that da^WcV) < e"/2 (this depends on e, but as 
we remarked after the definition, this is OK). We claim that V is a good test property for V 
(for the given e). 

Let G be a graph, n e < n < \V(G)\, and let H — G(n,G). First, suppose that G E V. By 
(i), d a (W G ,T) < e"/4. Further by (ii), 6 a {H,G) = 6 a {W H ,W G ) < e" /A with probability at 
least 2/3. In every such case, 

d D (W H ,V) < 5 n (W H ,W G ) + d a (W G ,V) < e"/2. 

Thus H has property V . 

Second, suppose that d\{G, V) > e; we want to prove by contradiction that H does not have 
the property V with probability at least 2/3. Assume that it is not true, then with probability 
larger than 1/3, d a (W H ,V) < e"/2. We have d a (W G , W H ) < e"/ 4 with probability more than 
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2/3. This implies that there exists at least one induced subgraph H of G with n nodes such that 
d a {W G , W H ) < e"/4 and d D (W H ,V) < e"/2. We obtain that d a (W G ,T) < e" . It follows from 
the testability of V that di(Wc, V) < e'. It follows from the robustness of V that di(G,V) < e, 
a contradiction. □ 

A further connection between testable graph properties and graphon properties is the follow- 
ing fact: 

Theorem 3.19 For every closed testable graphon property TZ there exists a testable graph prop- 
erty V such that TZ = V. 

Proof. Let e > 0. Since TZ is testable, it follows by Theorem 13.41 that there is and e' > 
such that B\j(TZ,e') C Bi(lZ,s). The set TZ, with the distance function is a compact metric 
space, and hence it can be covered by a finite number of balls Bi = B\j(Wi, e' /2), i = 1, . . . , m. 
For every 1 < i < m, there is a smallest positive integer rii such that if n > rij then there is a 
graph G with n nodes such that Wg £ Bi. Let n e = max^ rij. So for every U G TZ, and n > n e 
there is a graph G with n nodes such that (using the i for which Bi 3 U), 

Sa(W G , U) < S D {W G , Wi) + 6 D (W t , U) < £ - + £ - = e' < e. 

This also implies that 5i(Wg,7Z) < e. 
For every n > 1, let 

/ 50 , 1N 

E n = max =, supje : n e >n|). 

Vl°g log n 

Clearly e n \ as n — > oo, and e„ e > e. We prove that the property V — {G : Si(Wg,TZ) < 
e |v(G)|} is robust and its closure is TZ. 

First we show that R D P. Indeed, if W € P, then there is a sequence G„ € V with 
|V(G n )| -> oo such that S a (W Gn ,W) -> 0. Since G n e V means that (5 D (W / G „,^) < 
Si(Wg„,TZ) < £|v(g„)| — * 0, it follows that 5n(W,72.) = 0. Since 7?. is closed, this implies 
that W G TZ. 

To show that KCP, consider any U G TZ, and let e > 0. As we have seen above, there is 
a graph G S "P with n e nodes such that 5d(M / g, U) < s and <5i(Wg,7?.) < £ < £« e - So G e V, 
which shows that there is a function Wg with G £ V arbitrarily close to U. 

To show that property V is robust, consider any graph G with n nodes. Choose U € TZ so 
that ||Wg — < 2di(W / G, TZ). By Corollary 12.71 there is a graph G on n nodes such that 
di(G, U) < 4|| W G - t/||i < 8di(W G ,^) and ||Wg - tf|| n < Vio ^ ogn < e„. Thus GeV, and 

di(G,P) < di(G,G) = \\W G - W s \\! < \\W G - U\\i + \\G - U\\ x 
< 2d 1 {W G ,TZ) +8d 1 {W G ,TZ) = Wd^WcTZ). 

Thus V is robust and V = TZ is testable. Theorem 13. 181 implies that T 3 is testable. □ 

Let us say that two graph properties V and V' are equivalent if their closure is the same. 
Theorem 13.191 implies that equivalence classes of testable graph properties are in a one to one 
correspondence with the testable graphon properties. 
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As an application of these results, we give a purely combinatorial characterization of testabil- 
ity, which generalizes the result of Alon and Shapira on the testability of hereditary properties, 
and also contains a finite analogue of Theorem 13.51 

Theorem 3.20 For a graph property V , the following are equivalent: 
(a) V is testable; 

(Jo) For every e > there is an e' > and an n' > such that if G G V, G is any other 
graph such that S a (G,G') < e' , and \V(G)\, \V(G')\ > ri , then di(G' ,V) < e. 

(c) For every e > there is an Eq > and an no > such that i/G € ? and G' is an induced 
subgraph of G such that S\j(G,G') < £o and \V{G')\ > no, then di(G',V) < e. 

Remark 3.21 Condition (b) says that, roughly speaking, if a graph G is close to a graph H e V 
in the distance, then it is also close to a (possibly different) graph J € V in edit distance. 
But we need to be careful: Let V be the (trivial) graph property of having at most 1 node, then 
a large edgeless graph will be close to V in the 6\j distance, but not in d\. The theorem shows 
that it is enough to add the assumption that the graphs are large enough. 

Condition (c) is a weakening of Alon-Shapira condition that the graph is hereditary. Theorem 
2.11 in [7] implies that a randomly chosen fc-node induced subgraph of G is closer to G than 
10/^/log k in the S\j distance, with large probability. So we can think of induced subgraphs G' 
satisfying Sa(G,G') < sq as "typical". 

Proof. Lemma f3. Ill savs that (a)=>(b), and (b)=>(c) is trivial. To prove that (c)=>(a), we start 
with proving a version of the condition in the theorem for functions. 

Claim 3.21.1 For every e > there is an e\ > and an n\ > such that ifW^V and G is 
graph such that \V(G)\ > rii, t ind (G, W) > and d a (W G , W) < e\, then di(G,T>) < e. 

Given e > 0, let £o an d no be as in the condition of the theorem, and set e± = £o/2, 
m = no. Let H n e V be a sequence of graphs such that H n — > W. Then tmd(G,H n ) — » 
Und(G, W) > 0, so for large enough n, G is an induced subgraph of H n . Furthermore, we have 
5 a (G,H n ) < S a (W G , W) + 5 D (W,W Hn ) S a (W G , W) < e /2, and so for large enough n we 
have 5a(G,H n ) < so. So the condition of the theorem implies that di(G,'P) < e. This proves 
Claim IMUJ 

To prove that V is testable, we use Theorem l3.18l it suffices to prove that V is testable and V 
is robust. To prove that V is testable, we use Theorem l3.4l We want to prove that if a function 
is close to V in the ||.||p norm, then it is also close in the norm. Our next step is proving a 
special case of this. 

Claim 3.21.2 For every e > there is an £2 > such that if W € V and U S W is a function 
such that U — W wherever W(x,y) G {0, 1} and d a (W : U) < e 2 , then di(U,V) < e. 

Let £1 and n\ be as in Claim |3~.21.U and set £2 = £i/2. Choose a sequence G n of graphs 
such that G n ^ U and t- m d(G n , U) > 0. (For example, random graphs G n — G(n, U) have this 
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property with large probability.) By the condition on U, we have ti n<x (G n , W) > 0. Furthermore, 
8o(Wa n ,W) < 6 n (W Gn ,U) + 5 D (U,W) -► 5 U {U,W) < £i/2, and hence for large enough n, 
we have Sa(Wc n , W) < e x . If we also choose n large enough so that |V(G„)| > i%i, then 
d x (G n ,V) < e by ClaimEHnj Thus there exist graphs H n £ V with V(H n ) = V(G n ) such that 
d x (G n , H n ) < e. We choose a subsequence such that H n — ► W' for some function W', which is 
then in P. Now Lemma [2.111 implies that S X (U, W) < e, and hence di(U,'P) < e. 

Claim 3.21.3 V is testable. 

We use Theorem l3.41 Let (U n ) be a sequence of functions in Wo such that da(U n , V) — > 0, we 
want to prove that di(U n ,V) — > 0. We may assume that (£/„) is convergent in the S x distance, 
and hence applying appropriate measure preserving transformations, we may assume that there 
is a function W eWo such that \\U n - W\\a -* 0. Let 



Let e > and choose ri2, £2 as in Claim [372 1.21 with input e/2. Just as in the proof of Theorem 
13.301 we see that \\U n — U' n \\i — > 0, so we can choose a large enough n such that ||Z7„ — C/,'J|i < 
min{e/2,e 2 /2} and also \\U n - W\\a < £2/2. Then U' n = W wherever W G {0, 1} and \\U' n - 
W\\ u <W' n - UnWu + \\U n - W\\ n < \\K - U n \\i + \\U n - W\\ n < e 2 , and so by ClaimfMLH 
we have \\U' n - W\\i < e/2. Thus \\U n -W\\i< \\K - U n \\i + \\U n - W\\i < e. 

Claim 3.21.4 V is robust. 

Let e > 0, let £1 and ni be chosen so that they satisfy Claim |3~.21.1I with input e/2, and 
let z' = min(e/4, £i/3). Let G be a graph with |V(G)| = m > m and di(WG,V) < e' . Let 
W £ V be a function such that \\Wg ~ W\\i < e' . Consider the random graph G' = G'(n, W). 
Then t- m d{G' ,W) > with probability 1, and by Lemma [2.51 with probability tending to 1, 
\\Wg' — W\\u < s' if n is large enough. Furthermore, Lemma [2.61 implies that E(di(G, G')) = 
\\Wg — W\\i < e', and so with probability at least 1/2, we have di(G, G') < 2e' < e/2. In such 
a case 

\\W G - W\\ n <\\W G - WcWu + \\W G ' - W||n < \\W a ~ W G >\\i + W& -W\\ n < 3e' = £1, 

and so by Claim [5213 we have di(G',V) < e/2. Hence d x {G,V) < d x (G, G') + d x {G' , V) < e, 
which proves that V is robust. 

Using Theorem 13.181 this completes the proof of Theorem 13.201 □ 

3.3 Property testing vs. parameter testing 

Parameter testing is a problem related to property testing, and in some respects simpler; but the 
main facts are often analogous. It was introduced in [7j, where a number of different character- 
izations were also given (see also [S]). We summarize the main results about parameter testing, 
to point out this analogy; finally, we prove a direct connection between these notions. 




W(x,y), if 14^,2,) €{0,1}, 
U n {x,y), otherwise. 
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A graph parameter is a function defined on isomorphism types of graphs. A graph parameter 
/ is testable, if for every e > there is a positive integer k = k(e) such that if G is a graph with 
at least k nodes and S is a random subset of k nodes of G (chosen uniformly over all fc-element 
sets), then 

P(|/(G)-/(G[5])|>e)<l/3. 
Testability of parameters is related to the convergence of graph sequences: 

Proposition 3.22 (|7J) A graph parameter f is testable if and only if f{G n ) converges for 
every convergent graph sequence (G n ). 

A graphon functional is a real valued function defined on graphons; equivalently, a functional 
defined on Wo that is invariant under isomorphism. A graphon functional / is testable, if there 
is a graph parameter g such that for every e > there is a positive integer k(e) such that for 
every W € Wo and k > k(e), 

P(\f(W) - g(G(k,W))\ >e) < 1/3. 

Informally, we can estimate the value of f(W) by generating a VT"-random graph with sufficiently 
many nodes, and evaluating the graph parameter g on this. 
Testable graphon parameters have a simple characterization: 

Proposition 3.23 A graphon parameter is testable if and only if it is continuous in the norm 

1Mb- 

Proof. Suppose that / is testable. If \\U — W\\\j is small enough, then the distributions of 
G(k,W) and G(k,U) are close, and so there is a graph F such that |/(W) — g(F)\ < e and 
|/(77) - g(F)\ < £, so \f{W) - f(U)\ < 2e. So / is continuous. 

Conversely, if / is continuous, then it is uniformly continuous by the compactness of W, and 
so for every e > there is an e' > such that if 5n(U, W) < e' then \f(U) - f{W)\ < e' . Define 
g(F) = f{Wp) for a graph F, By Theorem 12 .41 if k is large enough, then with large probability, 

Sa(W G{k , w) ,W)<s', 

and hence with large probability, 

\g{G(k,W)) - f(W)\ = \f(W G{k , w) ) - f(W)\ < e. 

This proves that / is testable. □ 
Theorem 5.1 in 7] implies the following. 

Proposition 3.24 A graph parameter is testable if and only if there is a testable graphon pa- 
rameter f such that f(W G ) - g(G) -»0if \V(G)\ -> oo. 
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We can view this last fact as follows: Every graphon functional / gives rise to a graph param- 
eter / by f(G) — /(Wg)- This graph parameter / is testable if / is testable, and testable graph 
parameters are exactly those that are "asymptotically equal" to / for some testable graphon 
functional /. 

Alon and Shapira [5] proved that the edit distance from every hereditary graph property is 
a testable parameter. More generally, Fischer and Newman [H] proved: 

Theorem 3.25 (Fischer— Newman) A graph property V is testable if and only if the edit 
distance di(G^V) is a testable parameter. 

The following theorem gives an analogue of this result for graphons, from which Theorem 
13.251 can be deduced: 

Theorem 3.26 A graphon property TZ is testable if and only if the distance TZ) is a testable 
functional. 

The content of the theorem is that the d\ distance from a testable property is continuous 
function in the ||.||n norm. It is trivial that this distance is continuous in the ||.||i norm for 
any graphon property. The second half of the proof below shows that the d\ distance from any 
graphon property is lower semi-continuous in the ||.||n norm. 

Proof. It is easy to see that if the functional d\(.,TV) is testable then TZ is testable: the set 
{W : di(W,lZ) < e} is open in the ||.||n norm, and contains the compact set TZ, so it contains 
a neighborhood Bu(TZ, e') of TZ for some e' > 0. 

Now suppose that 1Z is testable. Let W G Wo and let W n — » W. We claim that d\ (W n ,TZ) — > 
d 1 (W,TZ). We may assume that \\W n - W\\ D -> 0. 

Let e > 0, and let U € TZ be such that \\W - U\\i < di(W,K) + e. By LemmaOyjl there is 
a sequence of functions U n £ W such that \\U n — U\\a — * and \\U n — W n \\i — > \\U — W\\i. By 
the testability of TZ and by Theorem 13.41 it follows that \\U n — U\\i — > 0, and so 

||W» - t/||i < \\W n - U n \\l + \\U n -U\\i->\\U- W\\i. 

Hence 

lhnswpdtiW^IZ) < limsup \\W n - U\\i < ||I7-W||i <di(W,^)+e. 

n — >oo n — ^oo 

Since e > is arbitrary, this implies that 

lim sup di (W n ,n)<d t (W, TZ) (24) 

n — >oo 

To prove the reverse, let V n G TZ be chosen so that \\W n — V n \\ < d\(W n ,TZ) + 1/n. By 
selecting a subsequence, we may assume that the sequence (V„) is convergent in the 6q distance. 
Let V G Wo be its limit. Clearly V G TZ. Thus by Lemma I^TTl we have 

di(W,TZ) < 5i{W,V) < limmf<5i(W n ,F„) < liminf d x {W n , TZ). (25) 

n — kxd n — yoo 

The relations (f2"4")l and (|25p prove that di(W n ,TZ) —> di(W,TZ), and so d\{.,TZ) is continuous. 

□ 
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The theorem of Fischer and Newman (Theorem I3.25[) is easy to derive from here. By the 
results of [7] Theorem 6.1(d), it suffices to prove that for every testable property V, di(W,V) is 
a continuous function of W in the ||.||p norm, and d\{G,V) — di(Wc,V) — > if |V(G)| — > oo. 
The first assertion follows from Theorem 13. 26i the second, from Proposition 13.13T b) . 

3.4 Flexible properties 

Let W € Wo- A function U € Wo is a flexing of W if U(x,y) = y) for all x,y with 

W(x, y) € {0, 1} (so we may change the values of W that are strictly between and 1; note 
that we may change these to or 1, so the relation is not symmetric). We say that a function 
property is flexible if it is preserved under flexing. The following Proposition gives some sufficient 
conditions for flexibility; the proofs are straightforward and omitted. 

Proposition 3.27 (a) Every function property that implies that the function is — 1 valued is 
flexible. 

(b) For every graph F, the function property {W € Wo : t(F, W) — 0} is flexible. 

(c) IfTZisa flexible function property, then the function properties 1Z* = {1 — W : W € TZ}, 
TZ' and TZ^ are flexible. 

(d) The intersection and union of any set of flexible properties is flexible. 
We call a graph property flexible if its closure is flexible. 

Example 3.28 (a) The graph property that G is clique of size [~|V(G)|/2] , together with isolated 
nodes, is flexible, since its closure consists of a single graphon (represented by the function W 
that is 1 if x,y < 1/2 and otherwise). 

(b) The graph property that u>(G) > | V(G)|/2 is flexible (where w(G) is the size of the largest 
clique in G), since it is the upward closure of the property in (a). Similarly, the property that 
a (G) > \V(G)\/2 is flexible (where a(G) is the size of the largest independent set in G). 

(c) The graph property that there is a labeling of the nodes by {1, . . . , n} such that two nodes 
are connected if and only if their labels sum to at most n, is flexible. Indeed, the closure of this 
graph property consists of a single graphon (represented by the function W that is 1 if x + y < 1 
and otherwise). This closure is flexible by Proposition 13. 27f a) . 

(d) The graph property that there is a labeling of the nodes by {1, . . . , n} such that all pairs 
whose labels sum to at most n are connected by an edge, is flexible. Indeed, this is the upward 
closure of the property in (c) . 

These and other examples follow from the following proposition: 

Proposition 3.29 (a) Every graph property V for which V consists of 0—1 valued functions is 
flexible. 

(b) If a graph property is flexible, then so are its upward and downward closures. 

(c) If a graph property V is flexible, then so is the property obtained by complementing all 
graphs in V . 
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(d) Every hereditary graph property is flexible. 



Proof, (a) is obvious, (b) follows from Theorem 13 . 1 51 and Proposition 13.271 (c). Assertion (c) 
follows from Proposition 13.271 (c). Finally, (d) follows from 13.91 fa), since if t(F,W) = 0, then 
t(F, U) = for every flexing U of W. □ 

Our main result about flexible properties is the following. 

Theorem 3.30 (a) Every closed flexible graphon property is testable. 
(b) Every robust flexible graph property is testable. 

Remark 3.31 1. It is important to assume that the property is closed. We have seen that the 
function property of being 0-1 valued is flexible and it is trivially a graphon property, but it is 
not testable. 

2. We can weaken this notion slightly in a way that we preserve testability. We say that 
a function property is weakly flexible if it is closed under those flexings that do not change the 
integral of the function. A good example for this is the property which consists of those functions 
whose integral is 1/2. One can modify the proof of Theorem 13.301 to show that every weakly 
flexible function property is testable. 

Proof. The second assertion is an immediate consequence of the first, so we only prove (a). 

Assume that dn(W n ,1Z) — > but di(W n ,TZ) > e for some fixed e > 0. We can assume that 
W n converges to some W e K in the ||.|| n norm. Let S Q = W _1 (0) , S x = W _1 (l) and let 
Z n G Wo denote the function which is 1 on Si, on So and is identical with W n anywhere else. 
By flexibility, we have Z n 6 1Z. By Lemma [2~2l 

lim \\W n - Z n \\x = lim f / W n + [ (1 - W n )) = [ W+ [ (1 - W) = 0, 

which is a contradiction. □ 

The theorem of Alon and Shapira 2\ follows easily. Let V be a hereditary graph property. 
By Proposition 13 . 291 V is flexible, and so by Theorem 13.301 its closure is testable. So it suffices 
to prove that V is robust, which follows from Proposition 13.13( b) . 

Let U, W £ Wo and consider a convex combination Z = all + (1 — o/)W, < a < 1. Then 
both U and W are flexings of Z. This implies the following very useful observation: 

Proposition 3.32 If 1Z is a flexible function property, then Wo \ TZ is a convex set. □ 

Corollary 3.33 The distance d\{U,TZ) from a flexible function property is a concave function 
ofU. □ 

Alon and Stav [3] proved the surprising fact that for every hereditary graph property V there 
is a number p, < p < 1, such that the random graph &(n,p) is, asymptotically, at maximum 
edit distance from the property: 
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Theorem 3.34 (Alon and Stav) For every hereditary graph property V there is a number p, 
< p < 1, such that for every graph G with \V{G)\ = n, 

di{G,V) < E(d 1 (G(n,p),V))+o{l) (n oo). 

The following theorem states a functional version and a generalization of this fact. 

Theorem 3.35 (a) For every flexible graphon property 1Z, the maximum of d\{.,1V) is attained 
by a constant function. 

(b) For every flexible and robust graph property V there is a number p, < p < 1, such that 
for every graph G with \V(G)\ = n, 

di{G,V) < E(di(G(n,p),7>)) + o(l) (n ^ oo). 

By Proposition I3.29f d). Theorem 13.341 is a consequence. A further corollary is that the 
conclusion of Theorem 13.351 holds for the properties in Example I3.28f a),fb). 

Proof. Part (a) of the theorem is an easy consequence of Proposition 13. 321 and Corollarv l3.33l 
(The proof in fact works for any norm on W instead of the norm ||.||i.) 

To prove part (b), let p £ [0,1] maximize di(p,V) (where p also denotes the constant p 
function), and let b denote the maximum value. It suffices to prove that 

E(di(G(n,p),P))>6 + o(l) (n ->«>), (26) 

and 

di(G,V)< 6 + o(l) (\V(G)\ - oo). (27) 

The first bound is quite easy: We can fix a choice G n for the random graphs so that 
d x {G n ,V) < E(di(G(n,p),V)) + o(l) and G n -> p. Then by Proposition EH we have 

liminf di(G n ,V) > di(p,V) = b. 

Then there exists a sequence of graphs G n with |V(£r n )| — * oo such 

di(G n ,V) -» 6' > b. 
W G Wo- By part (a), 

Since the property V is robust and flexible, it is testable, and hence by Proposition 13. 13f b) . we 
have di{G n ,V) - d 1 (W Gn ,V) -> 0. Thus d 1 (W Gn -J 5 ) -> b'. Theorem E26] then implies that 
di(W, V) — b' , a contradiction. □ 



Suppose that (f27|) fails, 
that 

We may assume that G n — 
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4 Concluding remarks 



We have mentioned after the definition of testable properties that some modifications in the 
definition do not change the notion of testability. Let us discuss some other possible modifications 
that would lead to a different, generally less interesting notion. 

Examples I3.8f b~) and (c) suggest that in the definition of of the edit distance, we could allow 
adding or removing nodes as well as adding or removing edges. This would of course change 
which graph properties are testable, but would not change the closure of testable properties, due 
to Theorem 15301 

Example l3.8f d) is counterintuitive again, since the densities in the definition can be estimated 
from samples easily. The trouble is that a small error in these densities only implies that the 
graph is close to a quasirandom graph in the d\j distance, not in d±. 

Considering how useful the cut distance has been, why don't we measure the error in cut 
distance? The answer is that this would lead to a trivial notion: Every graphon property is 
testable if we measure the error in the distance da- Indeed, given a graphon property TZ, we 
can define TZ' as the property of the graph G that d a (W G ,TZ) < 10/^/log \V(G)\. If W G Tl, 
then G = G{n,W) satisfies S\j(Wg,W) < 10/y%g |V(G)| with large probability, and in such 
cases G € W. Conversely, if da(W,H) > e, then for G = G(k,W) with k > 2 8 / £2 we have 
8ei{Wg,W) < e/2 with large probability; if this happens, then we must have d^WcTZ) > s/2 
(else it would follow that d\j(W, TZ) < e), and so if k is large enough, we have G G TZ' . 

We get a different and potentially interesting notion of testing function properties if instead 
of G(n, W), we consider the edge- weighted graph H(n, W) in which we select n random points 
X\, . . . ,X n from [0, 1], but instead of randomizing a second time to get the edges, we keep 
W(Xi,Xj) as the weight of the edge ij. The singleton set TZ — {U = 1/2} would become 
testable in this notion. We don't have a characterization of testability in this sense. 
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